On automatic phonetic transcription quality: lower word error rates do not guarantee better transcriptions
نویسندگان
چکیده
The first goal of this study was to investigate the effect of changing several properties of a continuous speech recognizer (CSR) on the automatic phonetic transcriptions generated by the same CSR. Our results show that the quality of the automatic transcriptions can be improved by using short hidden Markov models (HMMs) and by reducing the amount of contamination in the HMMs. The amount of contamination can be reduced by training the HMMs on the basis of a transcription that better matches the actual pronunciation, e.g., by modeling pronunciation variation or by training HMMs on read speech. Furthermore, we found that context-dependent HMMs should preferably not be trained on baseline transcriptions if there is a mismatch between these baseline transcriptions of the speech material and the realized pronunciation. Finally, we found that by combining the changes in the properties of the CSR, the quality of automatic transcription can be further improved. The second goal of this study was to find out whether a relationship exists between word error rate (WER) and transcription quality.As no clear relationshipwas found,we conclude that taking theCSRwith the lowest WER does not necessarily provide the optimal solution for obtaining optimal automatic transcriptions. 2003 Elsevier Ltd. All rights reserved.
منابع مشابه
Lower WERs do not guarantee better transcriptions
The goal of this paper is to investigate the effect of various properties of the CSR on automatic transcription. To this end, we used various versions of a continuous speech recognizer (CSR) to make automatic transcriptions. Our results show that changing certain properties of the CSR affects the resulting automatic transcriptions. The best results were obtained when ‘short’ hidden Markov model...
متن کاملValidation of phonetic transcriptions based on recognition performance
In fundamental linguistic as well as in speech technology re search there is an increasing need for procedures to automat ically generate and validate phonetic transcriptions. Whereas much research has already focussed on the automatic genera tion o f phonetic transcriptions, far less attention has been paid to the validation of such transcriptions. In the little research performed in this a...
متن کاملApplication-oriented validation o preliminary r
There is an increasing need for automatic procedures to generate and validate phonetic transcriptions. As the production of manual phonetic transcriptions tends to be time-consuming, error-prone and costly, procedures have been developed to derive phonetic transcriptions automatically by means of automatic speech recognition technology. Such automatic phonetic transcriptions are usually validat...
متن کاملA pplication-orien ted validation o f phonetic transcriptions: prelim inary results
There is an increasing need for automatic procedures to generate and validate phonetic transcriptions. As the production of manual phonetic transcriptions tends to be time-consuming, error-prone and costly, procedures have been developed to derive phonetic transcriptions automatically by means of automatic speech recogni tion technology. Such automatic phonetic transcrip tions are usually val...
متن کاملMaking a difference On automatic transcription and modeling of Dutch pronunciation variation for automatic speech recognition
The first goal of this study is to investigate the effect of several properties of acontinuous speech recognizer (CSR) on automatic phonetic transcription. Our resultsshow that changing certain properties of the CSR affects the resulting automatictranscriptions. The quality of the automatic transcriptions can be improved by using‘short’ HMMs and by reducing the amount of contami...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computer Speech & Language
دوره 18 شماره
صفحات -
تاریخ انتشار 2004